Nationality Word Graph for Fast Information Retrieval
نویسنده
چکیده
This paper presents a new technique for determining the information associated to the nationality words. The method makes use of a compressed search graph, namely directed acyclic word-graph (DAWG) and our introduced refinement technique. The former allows us to search quickly a nationality word in only one time scanning. The latter is introduced in our work because a DAWG cannot determine information to each key uniquely. The refinement is a set of simple rules. The if-parts are just character comparison of the terminal states of DAWG. The then-part provides the linguistic values. Our approach has been applied to the nationality words in English for being used via Internet in the task of Who is Who?
منابع مشابه
Cross-Lingual Word Representations via Spectral Graph Embeddings
Cross-lingual word embeddings are used for cross-lingual information retrieval or domain adaptations. In this paper, we extend Eigenwords, spectral monolingual word embeddings based on canonical correlation analysis (CCA), to crosslingual settings with sentence-alignment. For incorporating cross-lingual information, CCA is replaced with its generalization based on the spectral graph embeddings....
متن کاملبررسی تأثیرات ریشهیابی در بازیابی اطلاعات در زبان فارسی
Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...
متن کاملIntellectual Structure of Knowledge in Information Behavior: A Co-Word Analysis
Background and Aim: The intellectual structure of knowledge and its research front can be identified by co-word analysis. This research attempts to reveal the intellectual structure of knowledge in information behavior inquiries, via co-word, network analysis, and science visualization tools. Methods: Bibliometric methodology and social network analysis are used. Population comprises 2146 recor...
متن کاملGraph-based Algorithms for Natural Language Processing and Information Retrieval
Graph theory is a well studied discipline, and so are the fields of natural language processing and information retrieval. However, most of the times, they are perceived as different disciplines, with different algorithms, different applications, and different potential end-users. The goal of this tutorial is to provide an overview of methods and applications in natural language processing and ...
متن کاملA Unified Graph Model for Sentence-Based Opinion Retrieval
There is a growing research interest in opinion retrieval as on-line users’ opinions are becoming more and more popular in business, social networks, etc. Practically speaking, the goal of opinion retrieval is to retrieve documents, which entail opinions or comments, relevant to a target subject specified by the user’s query. A fundamental challenge in opinion retrieval is information represent...
متن کامل